Picture for Shaosheng Cao

Shaosheng Cao

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

Balancing Understanding and Generation in Discrete Diffusion Models

Add code
Feb 01, 2026
Viaarxiv icon

Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training

Add code
Jan 31, 2026
Viaarxiv icon

Benchmarking Machine Translation on Chinese Social Media Texts

Add code
Jan 30, 2026
Viaarxiv icon

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

One Token Is Enough: Improving Diffusion Language Models with a Sink Token

Add code
Jan 27, 2026
Viaarxiv icon

Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors

Add code
Jan 22, 2026
Viaarxiv icon

EComStage: Stage-wise and Orientation-specific Benchmarking for Large Language Models in E-commerce

Add code
Jan 06, 2026
Viaarxiv icon

RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services

Add code
Nov 10, 2025
Viaarxiv icon

Interleaving Reasoning for Better Text-to-Image Generation

Add code
Sep 09, 2025
Figure 1 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 2 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 3 for Interleaving Reasoning for Better Text-to-Image Generation
Figure 4 for Interleaving Reasoning for Better Text-to-Image Generation
Viaarxiv icon